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COMMERCE T/ 



TRANSMITTAL LETTER TO THE UNITED STATES 
DESIGNATED/ELECTED OFFICE (DO/EO/US) 
CONCERNING A FILING UNDER 35 U.S.C. 371 



ATTORNEY 'S DOCKET NUMBER 

205502-9004 



U S. APPLICATION NO. (If known, see 37 CER 1 5 

09/890604 



INTERNATIONAL APPLICATION NO. 
PCT/CAOO/00074 



INTERNATIONAL FILING DATE 
January 31, 2000 



PRIORITY DATE CLAIMED 
February 1, 1999 



TITLE OF INVENTION 

Modification of Lignin Composition of Gvmnosperms 



APPLICANT(S) FOR DO/EO/US 

ELLIS, David Dunham; CHAPPLE. Clinton Charles Spencer: GILBERT. Margarita 



Applicant herewith submits to the United States Designated/Elected Office (DO/EO/US) the following items and other information: 

1. }5| This is a FIRST submission of items concerning a filing under 35 U.S.C. 371. 

2. □ This is a SECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. 371 . 

3 n This is an express request to begin national examination procedures (35 U.S.C. 37 1(f)). The submission must include 
items (5), (6), (9) and (2 1 ) indicated below. 

4. Q The US has been elected by the expiration of 19 months from the priority date (Article 3 1). 

5.]0 A copy of the International Application as filed (35 U.S.C. 371(c)(2)) 
» a. & is attached hereto (required only if not communicated by the International Bureau). 
=i b. Q has been communicated by the International Bureau. 

1 c. fj] is not required, as the application was filed in the United States Receiving Office (RO/US). 

=i 6. □ An English language translation of the International Application as filed (35 U.S.C. 371(c)(2)). 
:l a. Q ' s attached hereto. 

i; b. □ has been previously submitted under 35 U.S.C. 154(d)(4). 

if 7. ?3 Amendments to the claims of the International Aplication under PCT Article 19 (35 U.S.C. 371(c)(3)) 

a. □ are attached hereto (required only if not communicated by the International Bureau). 

b. |XJ have been communicated by the International Bureau. 

\ c . □ have not been made; however, the time limit for making such amendments has NOT expired. 

I d. Q have not been made and will not be made. 

* 8. □ An English language translation of the amendments to the claims under PCT Article 19 (35 U.S.C. 371 (c)(3)). 
j 9. □ An oath or declaration of the inventor(s) (35 U.S.C. 371(c)(4)). 

1 0. □ An English lanugage translation of the annexes of the International Preliminary Examination Report under PCT 

Article 36 (35 U.S.C. 371(c)(5)). 

Items 11 to 20 below concern document(s) or information included: 

1 1 . Q An Information Disclosure Statement under 37 CFR 1 .97 and 1 .98. 

12. □ An assignment document for recording. A separate cover sheet in compliance with 37 CFR 3 .28 and 3.3 1 is included. 

1 3 . f§ A FIRST preliminary amendment. 

14. □ A SECOND or SUBSEQUENT preliminary amendment. 

1 5 . rj A substitute specification. 

16. A change of power of attorney and/or address letter. 

17. f_] A computer-readable form of the sequence listing in accordance with PCT Rule 13ter.2 and 35 U.S.C. 1.821 - 1.825. 

18. A second copy of the published international application under 35 U.S.C. 154(d)(4). 

19. □ A second copy of the English language translation of the international application under 35 U.S.C. 154(d)(4). 

20. ^] Other items or information: See attached sheet. 



n 1 MEflffl 




Independent claims 



MULTIPLE DEPENDENT CLMM(S) (if applicable) 



2 1 . [X] The following fees are submitted: 
BASIC NATIONAL FEE (37 CFR 1.492 (a) (l)-(S)): 
Neither international preliminary examination fee (37 CFR 1 .482) 
nor international search fee (37 CFR 1.445(aY™ - TTOB ™ 
and International Search Report not prepare 1 



>aid to USPTO 
EPOorJPO $1000.00 



International preliminary examination fee (37 CFR 1.482) not paid to 

USPTO but International Search Report prepared by the EPO or JPO S860.00 

International preliminary examination fee (37 CFR 1.482) not paid to USPTO 

but international search fee (37 CFR 1.445(a)(2)) paid to USPTO $710.00 

International preliminary examination fee (37 CFR 1.482) paid to USPTO 

but all claims did not satisfy provisions of PCT Article 33(l)-(4) $690.00 

International preliminary examination fee (37 CFR 1.482) paid to USPTO 

and all claims satisfied provisions of PCT Article 33(l)-(4) $100.00 

ENTER APPROPRIATE BASIC FEE AMOUNT = 



Surcharge of $130.00 for furnishing the oath or declaration later than □ 20 □ 30 
months from the earliest claimed priority date (37 CFR 1.492(e)). 



NUMBER FI LED 
5* 



NUMBER EXTRA 



TOTAL OF ABOVE CALCULATIONS 



|7| Applicant claims small entity status. See 37 CFR 1.27. The fees indicated above 
^ are reduced by 1/2. 



$ 860.00 



$612. UU 
$ 480.00 



SUBTOTAL = 



Processing fee of $130.00 for furnishing the English translation later than □ 20 □ 3 0 
months from the earliest claimed priority date (37 CFR 1.492(f)). 



TOTAL NATIONAL FEE 



Fee for recording the enclosed assignment (37 CFR 1 .21(h)). The assignment must be 
accompanied by an appropriate cover sheet (37 CFR 3.28, 3.3 1). $40.00 per property 



TOTAL FEES ENCLOSED 



$ 1,111.00 



charged: 



a. £2] A check in the amount of $ 1,111.00 

b. I — I Please charge my Deposit Account No. 

' — ' A HunlifatA ™v nf this shppt is p.nclos 



;r the above fees is enclosed, 
in the amount of $ 



o cover the above fees. 



A duplicate copy of this sheet is enclosed. 



c. £3 The Commissioner is hereby authorized to charge any additional fees which may be required, or credit any 

overpayment to Deposit Account No. 12-0064 . A duplicate copy of this sheet is enclosed. 

d. Q Fees are to be charged to a credit card. WARNING: Information on this form may become public. Credit card 

information should not be included on this form. Provide credit card information and authorization on PTO-2038. 



NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 has not been met, a petition to revive (37 CFR 
1.137 (a) or (b)) must be filed and granted to restore the application to pending status. 

SEND ALL CORRESPONDENCE TO: JiAtfG 

MICHAEL BEST & FRIEDRICH LLC *S^tui 
401 N. Michigan Ave., Suite 1700 Lisa C 



Chicago, IL 60611 name 

39937 

REGISTRATION NUMBER 



iC17Bec'cfPCT7PTD 0 1 AUG 2001 

UNITED STATES PATENT & TRADEMARK OFFICE Q 9 / 8 90 604 



IN THE 



IN RE APPLICATION OF : ELLIS, et al. 

DOCKET NO.: 205502-9004 

Based on International Publication No. WO 00/46382 

Published August 10, 2000 

International Application No. PCT/CA00/00074 

Filed January 31, 2000 

Priority Date Claimed: February 1, 1999 

FILED ON: HEREWITH 

FOR: Modification of Lignin Composition of Gymnosperms 



Art Group Unit: 
Examiner: 

FIRST PRELIMINARY 
AMENDMENT 



10: 



ASSISTANT COMMISSIONER OF PATENTS 
WASHINGTON, D.C. 20231 



AUTHORIZATION TO PAY AND PETITION FOR THE ACCEPTANCE OF ANY NECESSARY FEES: If any charges 
or fees must be paid in connection with the following Communication (including but not limited to the payment of issue 
fees), they may be paid out of our deposit account No. 12-0064. If this payment also requires a Petition, please construe this 
authorization to pay as the necessary Petition which is required to accompany the payment. 

Applicant herewith petitions the Commissioner of Patents and Trademarks to extend the time for response to the Office 
Action dated for month(s) from to 



. Submitted herewith is check No. _ 



_for $_ 



_ to cover the cost of the extension. 



If a check is lost, or otherwise does not accompany this Petition, please charge my deposit account number 12-0064 in the 
appropriate amount to cover the cost of the extension. Any deficiency or overpayment should be charged or credited to the 
above numbered deposit account. 

FIRST PRELIMINARY AMENDMENT 



CERTIFICATE OF MAILING 

I hereby certify that this correspondence is being 
deposited with the United States Postal Service as 
first class mail in an envelope addressed to: 
Assistant Commissioner of Patents, Washington, 
D.C. 20231 on: 

Date: 

Signature: 

Print: 



MICHAEL, BEST & FRIEDRICH L.L.C. 
401 N. Michigan Avenue, Suite 1700 
Chicago, IL 60611-2412 
Telephone: (312)661-2100 Facsimile: (312)661-0029 



Attorney Docket No. 205502-9004 
Serial No. NOT YET ASSIGNED 



FIRST PRELIMINARY AMENDMENT 



LV. ChilHs^V 
Atty. Registration No. 39937 
MICHAEL BEST & FRIEDRICH LLC 
401 N. Michigan Avenue, Suite 1700 
Chicago, IL 6061 1-2412 
Phone: (312)661-2100 
Fax: (312)661-0029 



Please cancel claims 10 and 12. 



Respectfully submitted, 
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IN THE 

UNITED STATES PATENT & TRADEMARK OFFICE 



$3 



IN RE APPLICATION OF: 
CASE: 

INT'L APPL'N NO. 
INT'L FILING DATE 
U.S. SERIAL NO.: 
FILED ON: 
FOR: 



ELLIS, et al. 

205502-9004 

PCT/CA00/00074 

31 JAN 2000 

09/890,604 

01 AUG 2001 

Modification of Lignin 
Composition of Gymnosperms 



RESPONSE TO: 

NOTIFICATION OF 
MISSING 

REQUIREMENTS UNDER 
35 U.S.C. 371 IN THE 
UNITED STATES DO/EO 



BOX: PCT 

ASSISTANT COMMISSIONER OF PATENTS 

UNITED STATES PATENT AND TRADEMARK OFFICE 

WASHINGTON, DC 20231 

Dear Sirs: 

[X] AUTHORIZATION TO PAY AND PETITION FOR THE ACCEPTANCE OF ANY NECESSARY 

FEES: If any charges or fees must be paid in connection with the following Communication (including but 
not limited to the payment of issue fees), they may be paid out of our deposit account No. 50-1965. If this 
payment also requires a Petition, please construe this authorization to pay as the necessary Petition which 
is required to accompany the payment. 

fx] Applicant herewith petitions the Commissioner of Patents and Trademarks to extend the time for response 

to the Office Action dated October 2, 2001 for three (3) month(s) from December 2, 2001 to March 2, 
2002. Submitted herewith is check No. 12612 which includes the $ 460.00 extension fee. If a check is 
lost, or otherwise does not accompany this Petition, please charge my deposit account number 50-1965 in 
the appropriate amount to cover the cost of the extension. Any deficiency or overpayment should be 
charged or credited to the above numbered deposit account. 



EXPRESS MAIL LABEL NO. EL 874 050 187 US 

Date of Deposit March 1. 2002 

I hereby certify that the above listed papers or fees are being 
deposited with the United States Postal Service "Express 
Mail Post Office to Addressee" service under 37 CFR 1.10 
On the date indicated above, addressed to the Commissioner 
of Patents and Trademarks, Washington, D.C. 20231. The 
person mailing these papers/fe 

SIGNATURE" ' 

Michael A. Kmnegarf 



MICHAEL BEST & FRIEDRICH LLC 
401 North Michigan Avenue, Suite 1900 
Chicago, Illinois 60611-4212 
(312) 661-2100 Fax: (312) 661-0029 



RESPONSE TO NOTIFICATION OF MISSING REQUIREMENTS 
UNDER 35 U.S.C. 371 IN THE UNITED STATES DO/EO 



This is in response to the Notification of Missing Requirements Under 35 U.S.C. 371 in the United 
States DO/EO dated October 2, 2001. 

Applicant hereby submits the Oath and Declaration, signed by each of the inventors, along with 
check number 12612 in the amount of $ 525.00 which includes the required surcharge of $ 65.00. 

Accompanying this submission is Applicants' Sequence Listing in computer readable form as 
required underr 37 C.F.R. 1.821(e). 



Applicant hereby states that the Sequence Listing submitted hereto as Exhibit A is recorded in 
computer readable form, that it is identical to the Sequence Listing submitted herewith on 3.5" floppy 
diskette, and that it does not include new matter which goes beyond the disclosure in the International 
Application. 



STATEMENT UNDER 37 C.F.R. 1.821(e-g), 1.825(b), and 1.825 (d) 




Respectfully submitted, 




03/07/8002 HNGIJYEN 00000073 09890604 




Charles A>Laf£ 
Reg. No. 19787 
Lisa C. Childs 
Reg. No. 39937 

LAFF, WHITESEL & SARET, LTD. 
401 North Michigan Avenue, Suite 1700 
Chicago, IL 60611 
(312)661-2100 
(312) 222-0818 (fax) 



Agents for Applicant 
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RAW SEQUENCE LISTING DATE: 03/22/2002 

PATENT APPLICATION: US/09/890,604 TIME: 11:40:36 

Input Set : A:\41193-~l.txt 

Output Set: N:\CRF3\03222002\l890604.raw 

3 <110> APPLICANT: Ellis, David D 

4 Chappie, Clinton C S 

5 Gilbert, Margarita 

7 <120> TITLE OF INVENTION: Modification of Lignin Composition of Gymnosperms 
9 <130> FILE REFERENCE: 41193-B 
11 <140> CURRENT APPLICATION NUMBER: 09/890,604 
C--> 12 <141> CURRENT FILING DATE: 2002-03-01 

14 <150> PRIOR APPLICATION NUMBER: US 60/118,124 

15 <151> PRIOR FILING DATE: 1999-02-01 

17 <150> PRIOR APPLICATION NUMBER: PCT/CAOO/00074 

18 <151> PRIOR FILING DATE: 2000-01-31 
20 <160> NUMBER OF SEQ ID NOS : 2 

22 <170> SOFTWARE: Patentln version 3.1 

24 <210> SEQ ID NO: 1 

25 <211> LENGTH: 5156 
2 6 <212> TYPE: DNA 

27 <213> ORGANISM: Arabidopsis thaliana 
2 9 <220> FEATURE: 

30 <221> NAME/KEY: CDS 

31 <222> LOCATION: ( 2487 )..( 3002 ) 

32 <223> OTHER INFORMATION: 

35 <2 2 0> FEATURE: 

36 <221> NAME/KEY: CDS 

37 <222> LOCATION: { 3902 )..( 4519 ) 

38 <22 3> OTHER INFORMATION: 

41 <22 0> FEATURE: 

42 <221> NAME/KEY: CDS 

43 <222> LOCATION: ( 3131 )..( 3556 ) 

44 <223> OTHER INFORMATION: 

47 <400> SEQUENCE: 1 

48 aagcttatgt atttccttat aaccatttta ttctgtatat agggggacag aaacataata 60 
50 agtaacaaat agtggtttta tttttttaaa tatacaaaaa ctgtttaacc attttatttc 120 
52 ttggttagca aaattttgat atattcttaa gaaactaata ttttaggttg atatattgca 180 
54 gtcactaaat agttttaaaa gacacgaagt tggtaagaac aggcatatat tattcgattt 240 
56 aattaggaat gcttatgtta atctgattcg actaattaga aacgacgata ctatgagctc 300 
58 atagatggtc ccacgaccca ctctcccatt tgatcaatat tcaactgagc aatgaaacta 360 
60 attaaaaacg tggttagatt aaaaaaataa attgtgcagg tagcggatat ataatactag 420 
62 taggggttaa aaataaaata aaacaccaca gtattaaatt tttgtttcaa aagtattatc 480 
64 aatagttttt ttgcttcaaa aatatcacaa atttttgtat gaaatatttc tttaacgaaa 540 
66 ataaattaaa taaaatttaa aatttatatt tggagttcta tttttaattt agagttttta 600 
68 ttgttaccac attttttgaa ttattctaat attaatttgt gatattatta caaaaagtaa 660 
70 aaatatgata ttttagaata ctattatcga tatttgatat tattgacctt agctttgttt 720 
72 gggtggagac atgtgattat cttattacct ttttattcca tgaaactaca gagttcgcca 780 
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RAW SEQUENCE LISTING DATE: 03/22/2002 

PATENT APPLICATION: US/09/890 , 604 TIME: 11:40:36 

Input Set : A:\41193-l.txt 

Output Set: N:\CRF3\03222002\l890604.raw 

74 ggtaccatac atgcacacac cctcgtgaag ccgtgactta atatgatcta gaacttaaat 840 
76 agtactacta attgtgtcat ttgaactttc tcctatgtcg gtttcacttc atgtatcgca 900 
7 8 gaacaggtgg aatacagtgt ccttgagttt cacccaaatc ggtccaattt tgtgatatat 960 

80 attgcgatac agacatacag cctacagagt tttgtcttag cccactggtt ggcaaacgaa 1020 

82 attgtcttta tttttttatg ttttgttgtc aatgtgtctt tgtttttaac tagattgagg 1080 

84 tttaatttta atacatttgt tagtttacag attatgcagt gtaatctgat aatgtaagtt 1140 

86 gaactgcgtt ggtcaaagtc ttgtgtaacg cactgtatct aaattgtgag taacgacaaa 1200 

88 ataattaaaa ttaaaggacc ttcaagtatt attagtatct ctgtctaaga tgcacaggta 12 60 

90 ttcagtaata gtaataaata attacttgta taattaatat ctaattagta aaccttgtgt 1320 

92 ctaaacctaa atgagcataa atccaaaagc aaaaatctaa acctaactga aaaagtcatt 1380 

94 acgaaaaaaa gaaaaaaaaa agagaaaaaa ctacctgaaa agtcatgcac aacgttcatc 1440 

i 96 ttggctaaat ttatttagtt tattaaatac aaaaatggcg agtttctgga gtttgttgaa 1500 

: 98 aatatatttg tttagccact ttagaatttc ttgttttaat ttgttattaa gatatatcga 1560 
; 100 gataatgcgt ttatatcacc aatatttttg ccaaactagt cctatacagt catttttcaa 1620 
' 102 cagctatgtt cactaattta aaacccactg aaagtcaatc atgattcgtc atatttatat 1680 
' 104 gctcgaattc agtaaaatcc gtttggtata ctatttattt cgtataagta tgtaattcca 174 0 
I 106 ctagatttcc ttaaactaaa ttatatattt acataattgt tttctttaaa agtctacaac 1800 
= 108 agttattaag ttataggaaa ttatttcttt tatttttttt tttttttagg aaattatttc 1860 
; 110 ttttgcaaca catttgtcgt ttgcaaactt ttaaaagaaa ataaatgatt gttataattg 1920 
; 112 attacatttc agtttatgac agattttttt tatctaacct ttaatgtttg tttccctgtt 1980 
114 tttaggaaaa tcataccaaa atatatttgt gatcacagta aatcacggaa tagttatgac 204 0 
; 116 caagattttc aaagtaatac ttagaatcct attaaataaa cgaaatttta ggaagaaata 2100 
; 118 atcaagattt taggaaacga tttgagcaag gatttagaag atttgaatct ttaattaaat 2160 
i! 120 attttcattc ctaaataatt aatgctagtg gcataatatt gtaaataagt tcaagtacat 2220 
! 122 gattaatttg ttaaaatggt tgaaaaatat atatatgtag attttttcaa aaggtatact 228 0 
• 124 aattattttc atattttcaa gaaaatataa gaaatggtgt gtacatatat ggatgaagaa 2 34 0 
i 126 atttaagtag ataatacaaa aatgtcaaaa aaagggacca cacaatttga ttataaaacc 2400 
\ 128 tacctctcta atcacatccc aaaatggaga actttgcctc ctgacaacat ttcagaaaat 2460 

130 aatcgaatcc aaaaaaaaca ctcaat atg gag tct tct ata tea caa aca eta 2 513 

131 Met Glu Ser Ser He Ser Gin Thr Leu 

132 1 5 

134 age aaa eta tea gat ccc acg acg tct ctt gtc ate gtt gtc tct ctt 2561 

135 Ser Lys Leu Ser Asp Pro Thr Thr Ser Leu Val He Val Val Ser Leu 

136 10 15 20 25 

138 ttc ate ttc ate age ttc ate aca egg egg cga agg cct cca tat cct 2 609 

139 Phe He Phe He Ser Phe lie Thr Arg Arg Arg Arg Pro Pro Tyr Pro 

140 30 35 40 

142 ccc ggt cca cga ggt tgg ccc ate ata ggc aac atg tta atg atg gac 2657 

143 Pro Gly Pro Arg Gly Trp Pro He He Gly Asn Met Leu Met Met Asp 

144 45 50 55 

146 caa etc ace cac cgt ggt tta gee aat tta get aaa aag tat ggc gga 2705 

14 7 Gin Leu Thr His Arg Gly Leu Ala Asn Leu Ala Lys Lys Tyr Gly Gly 

148 60 65 70 

150 ttg tgc cat etc cgc atg gga ttc etc cat atg tac get gtc tea tea 2 753 

151 Leu Cys His Leu Arg Met Gly Phe Leu His Met Tyr Ala Val Ser Ser 

152 75 80 85 

154 ccc gag gtg get cga caa gtc ctt caa gtc caa gac age gtc ttc teg 2 801 

155 Pro Glu Val Ala Arg Gin Val Leu Gin Val Gin Asp Ser Val Phe Ser 

156 90 95 100 105 
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Val 


Asp 


Asp 


Leu 


Leu 


Ala 


Phe 


Tyr 


Ser Glu 


Glu 


Ala 


Lys 


Leu 


Val 






210 








280 










285 








290 










212 


agt 


gag 


aca 


gcg 


gat 


ctt 


caa 


aat 


tec 


ate aaa 


ctt 


acc 


cgt 


gac 


aat 


3538 




213 


Ser 


Glu 


Thr 


Ala 


Asp 


Leu 


Gin 


Asn 


Ser 


He Lys 


Leu 


Thr 


Arg 


Asp 


Asn 






214 






295 










300 








305 












216 


ate 


aaa 


gca 


ate 


ate 


atg 


gtaattatat ttcaaaaagc actagtcata 




3586 




217 


He 


Lys 


Ala 


He 


He 


Met 
























218 




310 
































220 


gtcatgtttc ttaatgcgtt aegtaataat acttatccat 


tgaccagtta ttttctccta 


3646 




222 


agtttttttg tttgaattag gaaggtaatt 


ttctatttta 


ctagagaaag caacagattt 


3706 




224 


tagcatgatc tttttttaat atatatagaa 


gcattgaata 


ttcagatcta caataattat 


3766 
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226 


gaaactaatg aagagacas 


la aaatggagac 


agaaaaaaga 


aagagtggac tagtgtggat 


3826 




228 


atatttaatt ctaatttgat tttattagga 


cgttatattt 


aattctaatt tgattttttt 


3886 




230 


atttgatttt attag gac gtt atg ttt 


gga gga acg gaa acg gta gcg teg 


3937 




231 










Asp Val Met Phe Gly Gly Thr Glu Thr Val Ala Ser 






232 










315 








320 








325 






234 


gcg 


ata 


gag 


tgg 


gcc 


tta 


acg 


gag 


tta 


tta 


egg 


age 


ccc 


gag 


gat 


eta 


3985 




235 


Ala 


He 


Glu 


Trp 


Ala 


Leu 


Thr 


Glu 


Leu 


Leu 


Arg 


Ser 


Pro 


Glu 


Asp 


Leu 






236 








330 










335 










340 










238 


aaa 


egg 


gtc 


caa 


caa 


gaa 


etc 


gcc 


gaa 


gtc 


gtt 


gga 


ctt 


gac 


aga 


cga 


4033 




239 


Lys 


Arg 


Val 


Gin 


Gin 


Glu 


Leu 


Ala 


Glu 


Val 


Val 


Gly 


Leu 


Asp 


Arg 


Arg 






240 






345 










350 










355 












242 


gtt 


gaa 


gaa 


tec 


gac 


ate 


gag 


aag 


ttg 


act 


tat 


etc 


aaa 


tgc 


aca 


etc 


4081 




243 


Val 


Glu 


Glu 


Ser 


Asp 


He 


Glu 


Lys 


Leu 


Thr 


Tyr 


Leu 


Lys 


Cys 


Thr 


Leu 






244 




360 










365 










370 














246 


aaa 


gaa 


acc 


eta 


agg 


atg 


cac 


cca 


ccg 


ate 


cct 


etc 


etc 


etc 


cac 


gaa 


4129 




247 


Lys 


Glu 


Thr 


Leu 


Arg 


Met 


His 


Pro 


Pro 


He 


Pro 


Leu 


Leu 


Leu 


His 


Glu 






248 


375 










380 










385 










390 






250 


acc 


gcg 


gag 


gac 


act 


agt 


ate 


gac 


ggt 


ttc 


ttc 


att 


ccc 


aag 


aaa 


tct 


4177 


".3 


251 


Thr 


Ala 


Glu 


Asp 


Thr 


Ser 


He 


Asp 


Gly 


Phe 


Phe 


He 


Pro 


Lys 


Lys 


Ser 




11 


252 










395 










400 










405 






,3 


254 


cgt 


gtg 


atg 


ate 


aac 


gcg 


ttt 


gcc 


ata 


gga 


cgc 


gac 


cca 


acc 


tct 


tgg 


4225 


s 


255 


Arg 


Val 


Met 


He 


Asn 


Ala 


Phe 


Ala 


He 


Gly 


Arg 


Asp 


Pro 


Thr 


Ser 


Trp 






256 








410 










415 










420 








^ 


258 


act 


gac 


ccg 


gac 


acg 


ttt 


aga 


cca 


teg 


agg 


ttt 


ttg 


gaa 


ccg 


ggc 


gta 


4273 




259 


Thr 


Asp 


Pro 


Asp 


Thr 


Phe 


Arg 


Pro 


Ser 


Arg 


Phe 


Leu 


Glu 


Pro 


Gly 


Val 




M 


260 






425 










430 










435 










"3 


262 


ccg 


gat 


ttc 


aaa 


ggg 


age 


aat 


ttc 


gag 


ttt 


ata 


ccg 


ttc 


ggg 


teg 


ggt 


4321 




263 


Pro 


Asp 


Phe 


Lys 


Gly 


Ser 


Asn 


Phe 


Glu 


Phe 


He 


Pro 


Phe 


Gly 


Ser 


Gly 




9 


264 




440 










445 










450 














266 


cgt 


aga 


teg 


tgc 


ccg 


ggt 


atg 


caa 


eta 


ggg 


tta 


tac 


gcg 


ctt 


gac 


tta 


4369 




267 


Arg 


Arg 


Ser 


Cys 


Pro 


Gly 


Met 


Gin 


Leu 


Gly 


Leu 


Tyr 


Ala 


Leu 


Asp 


Leu 






268 


455 










460 










465 










470 






270 


gcc 


gtg 


get 


cat 


ata 


tta 


cat 


tgc 


ttc 


acg 


tgg 


aaa 


tta 


cct 


gat 


ggg 


4417 




271 


Ala 


Val 


Ala 


His 


He 


Leu 


His 


Cys 


Phe 


Thr 


Trp 


Lys 


Leu 


Pro 


Asp 


Gly 






272 










475 










480 










485 








274 


atg 


aaa 


cca 


agt 


gag 


etc 


gac 


atg 


aat 


gat 


gtg 


ttt 


ggt 


etc 


acg 


get 


4465 




275 


Met 


Lys 


Pro 


Ser 


Glu 


Leu 


Asp 


Met 


Asn 


Asp 


Val 


Phe 


Gly 


Leu 


Thr 


Ala 






276 








490 










495 










500 










278 


cct 


aaa 


gcc 


acg 


egg 


ctt 


ttc 


gcc 


gtg 


cca 


acc 


acg 


cgc 


etc 


ate 


tgt 


4513 




279 


Pro 


Lys 


Ala 


Thr 


Arg 


Leu 


Phe 


Ala 


Val 


Pro 


Thr 


Thr 


Arg 


Leu 


He 


Cys 






280 






505 










510 










515 












282 


get 


ctt 


taagtttatg gttcgagtca cgtggcaggg ggtttggtat 


ggtgaaaact 


4569 




283 


Ala 


Leu 


































284 




520 


































286 


gaaaagtttg aagttgccct catcgaggat 


ttgtggatgt 


catatgtatg tatgtgtata 


4629 




288 


cacgtgtgtt ctgatgaaaa cagatttggc 


tctttgtttg 


cccttttttt ttttttcttt 


4689 




290 


aatggggatt ttccttgaat gaaatgtaac 


agtaaaaata 


agattttttt caataagtaa 


4749 




292 


tttagcatgt tgeaaagate gatcttggat 


gagaacttct 


acttaaaaaa aaaaaaaaaa 


4809 




294 


ttttttttta gttatttcac ctttttcttt 


tgttctggtt 


gtatggttgc cattgtgtca 


4869 
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296 


attaggggct 


jgaagttcgc tggttaaggc taaatcagag 


ttaaagttat aattttacaa 


4 929 




298 


gcccaacc 


laa 


aggtcgcaga ttaaaaccac atgatattta 


taaaaaaaat tctaaggttt 


4 989 




300 


ttattagttt 


tattttcagt ttactgagta ctatttactt 


ttttattttt tgcaaataaa 


5049 




302 


tgtattttat 


satatttatg ttttttgtta taaactccaa 


acatacaggt ttcattacct 


5109 




304 


aaaaaaagac 


agagtggttt cgttaatttt gtttcattaa 


tctcgag 








5156 




307 


<210> SEQ ID NO 


2 




























308 


<211> LENGTH: 520 




























309 


<212> TYPE: 


PRT 






























310 


<213> ORGANISM: 


Arabidopsis 


thaliana 


















312 


<400> SEQUENCE: 


2 




























314 


Met 


Glu 


Ser 


Ser 


He 


Ser 


Gin 


Thr 


Leu 


Ser 


Lys 


Leu 


Ser 


Asp 


Pro 


Thr 






315 


1 








5 










10 










15 








318 


Thr 


Ser 


Leu 


Val 


He 


Val 


Val 


Ser 


Leu 


Phe 


He 


Phe 


He 


Ser 


Phe 


He 




vy 


319 








20 










25 










30 










322 


Thr 


Arg 


Arg 


Arg 


Arg 


Pro 


Pro 


Tyr 


Pro 


Pro 


Gly 


Pro 


Arg 


Gly 


Trp 


Pro 




; !,y 


323 






35 










40 










45 










O 


326 


He 


He 


Gly 


Asn 


Met 


Leu 


Met 


Met 


Asp 


Gin 


Leu 


Thr 


His 


Arg 


Gly 


Leu 




rfi 


327 




50 










55 










60 














330 


Ala 


Asn 


Leu 


Ala 


Lys 


Lys 


Tyr 


Gly 


Gly 


Leu 


Cys 


His 


Leu 


Arg 


Met 


Gly 




- 


331 


65 










70 










75 










80 






334 
335 


Phe 


Leu 


His 


Met 


Tyr 
85 


Ala 


Val 


Ser 


Ser 


Pro 
90 


Glu 


Val 


Ala 


Arg 


Gin 
95 


Val 






338 


Leu 


Gin 


Val 


Gin 


Asp 


Ser 


Val 


Phe 


Ser 


Asn 


Arg 


Pro 


Ala 


Thr 


He 


Ala 






339 








100 










105 










110 










342 


He 


Ser 


Tyr 


Leu 


Thr 


Tyr 


Asp 


Arg 


Ala 


Asp 


Met 


Ala 


Phe 


Ala 


His 


Tyr 






343 






115 










120 










125 












346 


Gly 


Pro 


Phe 


Trp 


Arg 


Gin 


Met 


Arg 


Lys 


Val 


Cys 


Val 


Met 


Lys 


Val 


Phe 






347 




130 










135 










140 














350 


Ser 


Arg 


Lys 


Arg 


Ala 


Glu 


Ser 


Trp 


Ala 


Ser 


Val 


Arg 


Asp 


Glu 


Val 


Asp 






351 


145 










150 










155 










160 






354 


Lys 


Met 


Val 


Arg 


Ser 


Val 


Ser 


Cys 


Asn 


Val 


Gly 


Lys 


Pro 


He 


Asn 


Val 






355 










165 










170 










175 








358 


Gly 


Glu 


Gin 


He 


Phe 


Ala 


Leu 


Thr 


Arg 


Asn 


He 


Thr 


Tyr 


Arg 


Ala 


Ala 






359 








180 










185 










190 










362 


Phe 


Gly 


Ser 


Ala 


Cys 


Glu 


Lys 


Gly 


Gin 


Asp 


Glu 


Phe 


He 


Arg 


He 


Leu 






363 






195 










200 










205 












366 


Gin 


Glu 


Phe 


Ser 


Lys 


Leu 


Phe 


Gly 


Ala 


Phe 


Asn 


Val 


Ala 


Asp 


Phe 


He 






367 




210 










215 










220 














370 


Pro 


Tyr 


Phe 


Gly 


Trp 


He 


Asp 


Pro 


Gin 


Gly 


He 


Asn 


Lys 


Arg 


Leu 


Val 






371 


225 










230 










235 










240 






374 


Lys 


Ala 


Arg 


Asn 


Asp 


Leu 


Asp 


Gly 


Phe 


He 


Asp 


Asp 


He 


He 


Asp 


Glu 






375 










245 










250 










255 








378 


His 


Met 


Lys 


Lys 


Lys 


Glu 


Asn 


Gin 


Asn 


Ala 


Val 


Asp 


Asp 


Gly 


Asp 


Val 






379 








260 










265 










270 










382 


Val 


Asp 


Thr 


Asp 


Met 


Val 


Asp 


Asp 


Leu 


Leu 


Ala 


Phe 


Tyr 


Ser 


Glu 


Glu 






383 






275 










280 










285 












386 


Ala 


Lys 


Leu 


Val 


Ser 


Glu 


Thr 


Ala 


Asp 


Leu 


Gin 


Asn 


Ser 


He 


Lys 


Leu 






387 




290 










295 










300 














390 


Thr 


Arg 


Asp 


Asn 


He 


Lys 


Ala 


He 


He 


Met 


Asp 


Val 


Met 


Phe 


Gly 


Gly 
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SEQUENCE LISTING 

<110> Ellis, David D 

Chappie, Clinton C S 
Gilbert, Margarita 

<12 0> Modification of Lignin Composition of Gymnosperms 

<130> 41193-B 

<140> 09/890,604 

<141> 2001-08-01 

<150> US 60/118,124 

<151> 1999-02-01 

<150> PCT/CAOO/00074 

<151> 2000-01-31 

<160> 2 

<170> Patentln version 3.1 

<210> 1 

<211> 5156 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (2487) . . (3002) 
<223> 



<220> 

<221> CDS 

<222> (3902) . . (4519) 
<223> 



<220> 

<221> CDS 

<222> (3131) . . (3556) 
<223> 



<400> 1 
aagcttatgt 


atttccttat 


aaccatttta 


ttctgtatat 


agggggacag 


aaacataata 


60 


agtaacaaat 


agtggtttta 


tttttttaaa 


tatacaaaaa 


ctgtttaacc 


attttatttc 


120 


ttggttagca 


aaattttgat 


atattcttaa gaaactaata 


ttttaggttg 


atatattgca 


180 


gtcactaaat 


agttttaaaa 


gacacgaagt 


tggtaagaac 


aggcatatat 


tattcgattt 


240 


aattaggaat 


gcttatgtta 


atctgattcg 


actaattaga 


aacgacgata 


ctatgagctc 


300 



atagatggtc 


ccacgaccca 


ctctcccatt 


tgatcaatat 


tcaactgagc 


aatgaaacta 


360 


attaaaaacg 


tggttagatt 


aaaaaaataa 


attgtgcagg 


tagcggatat 


ataatactag 


420 


taggggttaa 


aaataaaata 


aaacaccaca 


gtattaaatt 


tttgtttcaa 


aagtattatc 


480 


aatagttttt 


ttgcttcaaa 


aatatcacaa 


atttttgtat 


gaaatatttc 


tttaacgaaa 


540 


ataaattaaa 


taaaatttaa 


aatttatatt 


tggagttcta 


tttttaattt 


agagttttta 


SOO 


ttgttaccac 


attttttgaa 


ttattctaat 


attaatttgt 


gatattatta 


caaaaagtaa 


660 


aaatatgata 


ttttagaata 


ctattatcga 


tatttgatat 


tattgacctt 


agctttgttt 


720 


gggtggagac 


atgtgattat 


cttattacct 


ttttattcca 


tgaaactaca 


gagttcgcca 


780 


ggtaccatac 


atgcacacac 


cctcgtgaag 


ccgtgactta 


atatgatcta 


gaacttaaat 


840 


agtactacta 


attgtgtcat 


ttgaactttc 


tcctatgtcg 


gtttcacttc 


atgtatcgca 


900 


gaacaggtgg 


aatacagtgt 


ccttgagttt 


cacccaaatc 


ggtccaattt 


tgtgatatat 


960 


attgcgatac 


agacatacag 


cctacagagt 


tttgtcttag 


cccactggtt 


ggcaaacgaa 


1020 


attgtcttta 


tttttttatg 


ttttgttgtc 


aatgtgtctt 


tgtttttaac 


tagattgagg 


1080 


tttaatttta 


atacatttgt 


tagtttacag 


attatgcagt 


gtaatctgat 


aatgtaagtt 


1140 


gaactgcgtt 


ggtcaaagtc 


ttgtgtaacg 


cactgtatct 


aaattgtgag 


taacgacaaa 


1200 


ataattaaaa 


ttaaaggacc 


ttcaagtatt 


attagtatct 


ctgtctaaga 


tgcacaggta 


1260 


ttcagtaata 


gtaataaata 


attacttgta 


taattaatat 


ctaattagta 


aaccttgtgt 


1320 


ctaaacctaa 


atgagcataa 


atccaaaagc 


aaaaatctaa 


acctaactga 


aaaagtcatt 


1380 


acgaaaaaaa 


gaaaaaaaaa 


agagaaaaaa 


ctacctgaaa 


agtcatgcac 


aacgttcatc 


1440 


ttggctaaat 


ttatttagtt 


tattaaatac 


aaaaatggcg 


agtttctgga 


gtttgttgaa 


1500 


aatatatttg 


tttagccact 


ttagaatttc 


ttgttttaat 


ttgttattaa 


gatatatcga 


1560 


gataatgcgt 


ttatatcacc 


aatatttttg 


ccaaactagt 


cctatacagt 


catttttcaa 


1620 


cagctatgtt 


cactaattta 


aaacccactg 


aaagtcaatc 


atgattcgtc 


atatttatat 


1680 


gctcgaattc 


agtaaaatcc 


gtttggtata 


ctatttattt 


cgtataagta 


tgtaattcca 


1740 


ctagatttcc 


ttaaactaaa 


ttatatattt 


acataattgt 


tttctttaaa 


agtctacaac 


1800 


agttattaag 


ttataggaaa 


ttatttcttt 


tatttttttt 


tttttttagg 


aaattatttc 


1860 


ttttgcaaca 


catttgtcgt 


ttgcaaactt 


ttaaaagaaa 


ataaatgatt 


gttataattg 


1920 


attacatttc 


agtttatgac 


agattttttt 


tatctaacct 


ttaatgtttg 


tttccctgtt 


1980 


tttaggaaaa 


tcataccaaa 


atatatttgt 


gatcacagta 


aatcacggaa 


tagttatgac 


2040 



caagattttc aaagtaatac ttagaatcct attaaataaa cgaaatttta ggaagaaata 2100 

atcaagattt taggaaacga tttgagcaag gatttagaag atttgaatct ttaattaaat 2160 

attttcattc ctaaataatt aatgctagtg gcataatatt gtaaataagt tcaagtacat 2220 

gattaatttg ttaaaatggt tgaaaaatat atatatgtag attttttcaa aaggtatact 2280 

aattattttc atattttcaa gaaaatataa gaaatggtgt gtacatatat ggatgaagaa 2340 

atttaagtag ataatacaaa aatgtcaaaa aaagggacca cacaatttga ttataaaacc 2400 

tacctctcta atcacatccc aaaatggaga actttgcctc ctgacaacat ttcagaaaat 2460 



atcgaatcc aaaaaaaaca ctcaat atg gag tct tct ata tea caa aca eta 
Met Glu Ser Ser lie Ser Gin Thr Leu 



age aaa eta tea gat ccc acg acg tct ctt gtc ate gtt gtc tct ctt 
Ser Lys Leu Ser Asp Pro Thr Thr Ser Leu Val lie Val Val Ser Leu 



ttc ate ttc ate age ttc ate aca egg egg cga agg cct cca tat cct 
Phe He Phe He Ser Phe He Thr Arg Arg Arg Arg Pro Pro Tyr Pro 



ccc ggt cca cga ggt tgg ccc ate ata ggc aac atg tta atg atg gac 
Pro Gly Pro Arg Gly Trp Pro He He Gly Asn Met Leu Met Met Asp 



caa etc acc cac cgt ggt tta gee aat tta get aaa aag tat ggc gga 
Gin Leu Thr His Arg Gly Leu Ala Asn Leu Ala Lys Lys Tyr Gly Gly 



ttg tgc cat etc cgc atg gga ttc etc cat atg tac get gtc tea tea 
Leu Cys His Leu Arg Met Gly Phe Leu His Met Tyr Ala Val Ser Ser 



ccc gag gtg get cga caa gtc ctt caa gtc caa gac age gtc ttc teg 

Pro Glu Val Ala Arg Gin Val Leu Gin Val Gin Asp Ser Val Phe Ser 

90 95 100 105 

aac egg cct gca act ata get ata age tat ctg act tac gac cga gcg 

Asn Arg Pro Ala Thr He Ala He Ser Tyr Leu Thr Tyr Asp Arg Ala 
110 115 120 

gac atg get ttc get cac tac gga ccg ttt tgg aga cag atg aga aaa 

Asp Met Ala Phe Ala His Tyr Gly Pro Phe Trp Arg Gin Met Arg Lys 
125 130 135 

gtg tgt gtc atg aag gtg ttt age cgt aaa aga get gag tea tgg get 

Val Cys Val Met Lys Val Phe Ser Arg Lys Arg Ala Glu Ser Trp Ala 
140 145 150 



2513 



tea gtt cgt gat gaa gtg gac aaa atg gtc egg teg gtc tct tgt aac 
Ser Val Arg Asp Glu Val Asp Lys Met Val Arg Ser Val Ser Cys Asn 



2993 



155 160 165 

gtt ggt aag ctacttcaca tattcaccac tcttgctata tatatgtgca 3042 

Val Gly Lys 

170 

attaaacaaa tatgtaaaaa gtgaaagtac tcatttcttc tttctttagt atgtacttta 3102 

acatttaacc aaaacaattg taggtaag cct ata aac gtc ggg gag caa att 3154 
Pro lie Asn Val Gly Glu Gin lie 
175 180 



ttt gca ctg acc cgc aac ata act tac egg gca gcg ttt ggg tea gec 3202 
Phe Ala Leu Thr Arg Asn lie Thr Tyr Arg Ala Ala Phe Gly Ser Ala 
185 190 195 

tgc gag aag gga caa gac gag ttc ata aga ate tta caa gag ttc tct 3250 
Cys Glu Lys Gly Gin Asp Glu Phe lie Arg He Leu Gin Glu Phe Ser 
200 205 210 

aag ctt ttt gga gec ttc aac gta gcg gat ttc ata cca tat ttc ggg 3298 
Lys Leu Phe Gly Ala Phe Asn Val Ala Asp Phe He Pro Tyr Phe Gly 
215 220 225 

tgg ate gat ccg caa ggg ata aac aag egg etc gtg aag gec cgt aat 3 34 6 

Trp He Asp Pro Gin Gly He Asn Lys Arg Leu Val Lys Ala Arg Asn 
230 235 240 

gat eta gac gga ttt att gac gat att ate gat gaa cat atg aag aag 3394 
Asp Leu Asp Gly Phe He Asp Asp He He Asp Glu His Met Lys Lys 
245 250 255 260 

aag gag aat caa aac get gtg gat gat ggg gat gtt gtc gat acc gat 3442 
Lys Glu Asn Gin Asn Ala Val Asp Asp Gly Asp Val Val Asp Thr Asp 
265 270 275 

atg gtt gat gat ctt ctt get ttt tac agt gaa gag gee aaa tta gtc 3490 
Met Val Asp Asp Leu Leu Ala Phe Tyr Ser Glu Glu Ala Lys Leu Val 
280 285 290 

agt gag aca gcg gat ctt caa aat tec ate aaa ctt acc cgt gac aat 3538 
Ser Glu Thr Ala Asp Leu Gin Asn Ser He Lys Leu Thr Arg Asp Asn 
295 300 305 

ate aaa gca ate ate atg gtaattatat ttcaaaaagc actagtcata 3586 
He Lys Ala He He Met 
310 

gtcatgtttc ttaatgcgtt aegtaataat acttatccat tgaccagtta ttttctccta 3646 

agtttttttg tttgaattag gaaggtaatt ttctatttta ctagagaaag caacagattt 3706 

tagcatgatc tttttttaat atatatagaa gcattgaata ttcagatcta caataattat 3766 

gaaactaatg aagagacaaa aaatggagag agaaaaaaga aagagtggac tagtgtggat 3 82 6 

atatttaatt ctaatttgat tttattagga cgttatattt aattctaatt tgattttttt 3886 



atttgatttt attag gac gtt atg ttt gga gga acg gaa acg gta gcg teg 
Asp Val Met Phe Gly Gly Thr Glu Thr Val Ala Ser 
315 320 325 

gcg ata gag tgg gec tta acg gag tta tta egg age ccc gag gat eta 
Ala lie Glu Trp Ala Leu Thr Glu Leu Leu Arg Ser Pro Glu Asp Leu 
330 335 340 

aaa egg gtc caa caa gaa etc gee gaa gtc gtt gga ctt gac aga cga 
Lys Arg Val Gin Gin Glu Leu Ala Glu Val Val Gly Leu Asp Arg Arg 
345 350 355 

gtt gaa gaa tec gac ate gag aag ttg act tat etc aaa tgc aca etc 
Val Glu Glu Ser Asp lie Glu Lys Leu Thr Tyr Leu Lys Cys Thr Leu 
360 365 370 

aaa gaa acc eta agg atg cac cca ccg ate cct etc etc etc cac gaa 
Lys Glu Thr Leu Arg Met His Pro Pro lie Pro Leu Leu Leu His Glu 
375 380 385 390 

acc gcg gag gac act agt ate gac ggt ttc ttc att ccc aag aaa tct 
Thr Ala Glu Asp Thr Ser lie Asp Gly Phe Phe lie Pro Lys Lys Ser 
395 400 405 

cgt gtg atg ate aac gcg ttt gec ata gga cgc gac cca acc tct tgg 
Arg Val Met He Asn Ala Phe Ala He Gly Arg Asp Pro Thr Ser Trp 
410 415 420 

act gac ccg gac acg ttt aga cca teg agg ttt ttg gaa ccg ggc gta 
Thr Asp Pro Asp Thr Phe Arg Pro Ser Arg Phe Leu Glu Pro Gly Val 
425 430 435 

ccg gat ttc aaa ggg age aat ttc gag ttt ata ccg ttc ggg teg ggt 
Pro Asp Phe Lys Gly Ser Asn Phe Glu Phe He Pro Phe Gly Ser Gly 
440 445 450 

cgt aga teg tgc ccg ggt atg caa eta ggg tta tac gcg ctt gac tta 
Arg Arg Ser Cys Pro Gly Met Gin Leu Gly Leu Tyr Ala Leu Asp Leu 
455 460 465 470 

gec gtg get cat ata tta cat tgc ttc acg tgg aaa tta cct gat ggg 
Ala Val Ala His He Leu His Cys Phe Thr Trp Lys Leu Pro Asp Gly 
475 480 485 

atg aaa cca agt gag etc gac atg aat gat gtg ttt ggt etc acg get 
Met Lys Pro Ser Glu Leu Asp Met Asn Asp Val Phe Gly Leu Thr Ala 
490 495 500 

cct aaa gec acg egg ctt ttc gee gtg cca acc acg cgc etc ate tgt 
Pro Lys Ala Thr Arg Leu Phe Ala Val Pro Thr Thr Arg Leu He Cys 
505 510 515 

get ctt taagtttatg gttcgagtca cgtggcaggg ggtttggtat ggtgaaaact 
Ala Leu 
520 
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Met Glu Ser Ser lie Ser Gin Thr Leu Ser Lys Leu Ser Asp Pro Thr 
15 10 15 



Thr Ser Leu Val lie Val Val Ser Leu Phe He Phe He Ser Phe He 
20 25 30 



Thr Arg Arg Arg Arg Pro Pro Tyr Pro Pro Gly Pro Arg Gly Trp Pro 
35 40 45 



He He Gly Asn Met Leu Met Met Asp Gin Leu Thr His Arg Gly Leu 
50 55 60 



Ala Asn Leu Ala Lys Lys Tyr Gly Gly Leu Cys His Leu Arg Met Gly 
65 70 75 80 



Phe Leu His Met Tyr Ala Val Ser Ser Pro Glu Val Ala Arg Gin Val 
85 90 95 



Leu Gin Val Gin Asp Ser Val Phe Ser Asn Arg Pro Ala Thr He Ala 
100 105 110 



He Ser Tyr Leu Thr Tyr Asp Arg Ala Asp Met 



Ala Phe Ala His Tyr 



115 



120 



125 



Gly Pro Phe Trp Arg Gin Met Arg Lys Val Cys Val Met Lys Val Phe 
130 135 140 



Ser Arg Lys Arg Ala Glu Ser Trp Ala Ser Val Arg Asp Glu Val Asp 
145 150 155 160 



Lys Met Val Arg Ser Val Ser Cys Asn Val Gly Lys Pro He Asn Val 
165 170 175 



Gly Glu Gin He Phe Ala Leu Thr Arg Asn lie Thr Tyr Arg Ala Ala 
180 185 190 



Phe Gly Ser Ala Cys Glu Lys Gly Gin Asp Glu Phe He Arg He Leu 
195 200 205 



Gin Glu Phe Ser Lys Leu Phe Gly Ala Phe Asn Val Ala Asp Phe He 
210 215 220 



Pro Tyr Phe Gly Trp lie Asp Pro Gin Gly He Asn Lys Arg Leu Val 
225 230 235 240 



Lys Ala Arg Asn Asp Leu Asp Gly Phe He Asp Asp He He Asp Glu 
245 250 255 



His Met Lys Lys Lys Glu Asn Gin Asn Ala Val Asp Asp Gly Asp Val 
260 265 270 



Val Asp Thr Asp Met Val Asp Asp Leu Leu Ala Phe Tyr Ser Glu Glu 
275 280 285 



Ala Lys Leu Val Ser Glu Thr Ala Asp Leu Gin Asn Ser He Lys Leu 
290 295 300 



Thr Arg Asp Asn He Lys Ala He He Met Asp Val Met Phe Gly Gly 
305 310 315 320 



Thr Glu Thr Val Ala Ser Ala He Glu Trp Ala Leu Thr Glu Leu Leu 
325 330 335 



Arg Ser Pro Glu Asp Leu Lys Arg Val Gin Gin Glu Leu Ala Glu Val 
340 345 350 



Val Gly Leu Asp Arg Arg Val Glu Glu Ser Asp He Glu Lys Leu Thr 
355 360 365 



Tyr Leu Lys Cys Thr Leu Lys Glu Thr Leu Arg Met His Pro Pro He 
370 375 380 



Pro Leu Leu Leu His Glu Thr Ala Glu Asp Thr Ser He Asp Gly Phe 
385 390 395 400 



Phe He Pro Lys Lys Ser Arg Val Met He Asn Ala Phe Ala He Gly 
405 410 415 



Arg Asp Pro Thr Ser Trp Thr Asp Pro Asp Thr Phe Arg Pro Ser Arg 
420 425 430 



Phe Leu Glu Pro Gly Val Pro Asp Phe Lys Gly Ser Asn Phe Glu Phe 
435 440 445 



He Pro Phe Gly Ser Gly Arg Arg Ser Cys Pro Gly Met Gin Leu Gly 
450 455 460 



Leu Tyr Ala Leu Asp Leu Ala Val Ala His He Leu His Cys Phe Thr 
465 470 475 480 



Trp Lys Leu Pro Asp Gly Met Lys Pro Ser Glu Leu Asp Met Asn Asp 
485 490 495 



Val Phe Gly Leu Thr Ala Pro Lys Ala Thr Arg Leu Phe Ala Val Pro 
500 505 510 



Thr Thr Arg Leu He Cys Ala Leu 
515 520 
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MODIFICATION OF LIGNIN COMPOSITION JDF GYMNOSPERMS 
TECHNICAL FIELD 

5 This invention relates to the modification of the fignin composition of 

gymnosperm species, particularly conifer trees, to make such species more 
suitable for commercial exploitation. 

BACKGROUND ART 

10 

Lignin is a cell wall component present in vascular plants that decreases the 
permeability of cells, contributes to the strength and rigidity of the stem, and 
protects microfibrils from chemical, physical, and biological attack (Bugos et 
ai. 1991 [4]). [Note: for full details of references mentioned herein, see the 

\ 5 section below headed REFERENCES, the numbers provided in square 
brackets corresponding to the numbers in that section.] Despite its 
advantage to the plant, lignin greatly affects the agro-industrial uses of 
plants. Lignin content and composition alter the digestibility and dietary 
conversion of herbaceous crops and are undesirable in the conversion of 

20 wood into paper and pulp (Campbell and Sederoff 1996 [6]). Although lignin 
can contribute up to 25% of the mass of wood, from a pulp and paper 
viewpoint, lignin does not contribute to the usable biomass in pulping and 
hence is waste. More importantly, the extraction of lignin during chemical 
pulping is a costly and difficult process, involving chemical removal. There is 

25 a negative correlation between the amount of lignin removed and fiber yield 
with chemical pulping. Therefore, because the removal of lignin from fibers 
is a major cost, the modification of both lignin content and composition is a 
~ major focus of several research establishments world wide. Of importance is 
that trees with altered lignin, either decreased content or modified 

30 composition to reduce the energy needed to extract the lignin, could allocate 
more resources to the production of pulpable biomass with decreased costs. 
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Chemically, lignin is a highly complex network of phenylpropanoid units 
derived from the oxidative polymerization of one or more of three monolignol 
precursors which are the end products of the three major branches of the 

5 phenylpropanoid pathway (as shown in Figure 1 of the accompanying 
drawings, introduced the section below headed BRIEF DESCRIPTION OF 
THE DRAWINGS). As shown in the figure, branch 1 of the pathway yields 
the monolignol p-coumaryl alcohol which makes up the p-hydroxyphenyl 
residue when polymerized into lignin and is present in both angiosperms and 

io gymnosperms. Branch 2 yields the monolignol con iferyl alcohol which 

makes up the guaiacyl residues when polymerized into lignin and is present 
in both angiosperms and gymnosperms, yet is the predominant monolignol in 
gymnosperms. Branch 3 yields sinapyl alcohol which makes up the syringyl 
residues when polymerized into lignin and is present only in angiosperms, 

1 5 with very few exceptions. These exceptions include reports of syringyl lignin 
in the gymnosperm species Podocarpus and in some species of the 
Gnetales. However, these exceptions are considered rare and are usually 
not even mentioned in reviews on lignin biosynthesis. 

20 The presence of syringyl residues in angiosperm lignin via branch 3 in the 
phenylpropanoid pathway accounts for angiosperm lignin being easier to 
remove during pulping than gymnosperm lignin. One reason syringyl-lignin 
is easier to remove during pulping, as compared to guaiacyl-lignin produced 
by gymnosperms, is that the C-5 carbon of the phenyl ring in syringyl-lignin is 

25 protected by methoxylation from forming a C5-C5 bond with adjacent 
monolignol phenyl rings. Once formed, this C-C bond is very difficult to 
break during delignification and the presence of these bonds accounts for 
the fact that gymnosperm lignin is harder to pulp than angiosperm lignin. 
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The inventors of the present invention theorized that if the phenylpropanoid 
pathway in gymnosperms could be modified such that gymnosperm plants 
could produce lignin containing syringyl residues, via branch 3, or a 
modification thereof, of the phenylpropanoid pathway, this would be of great 
5 benefit because significant reductions in the pulping costs associated with 
lignin removal in gymnosperms would be enabled. 

However, this requires the creation of an entirely new pathway in 
gymnosperms, i.e., the creation of the enzymes and substrates in 
io gymnosperm species to enable the branch 3 phenylpropanoid pathway 
synthesis of syringyl-lignin to proceed through to completion. This is quite 
different in concept from arranging for over-expression of a gene in an 
existing metabolic pathway, which is likely to shuttle more metabolites 
through the pathway, provided other steps do not become limiting. 

15 

There are numerous reports on the modification of the phenylpropanoid 
pathway by genetic engineering. One example is the "sense" suppression of 
PAL by a bean PAL2 gene in tobacco. These experiments demonstrated 
that PAL activity becomes rate-limiting to lignin deposition when levels are 3- 

20 to 4-fold lower than in wild-type plants (Bate et al. 1994 [2]). While PAL may 
hold promise for use in engineered lignin modification, it has been suggested 
that due to its key role in general phenylpropanoid metabolism, the 
interruption of PAL synthesis would also affect other biochemical pathways. 
In contrast, the activity of CAD, an enzyme well downstream in the lignin 

25 biosynthetic pathway, can be reduced to 10% of normal levels and still have 
no effect on the quantity of lignin, although clear qualitative differences are 
observed (Halpin et al. 1994 [12]). From these and other studies on the 
manipulation by genetic engineering of key enzymes in the lignin 
biosynthetic pathway (OMT (Dwivedi et al. 1994 [10]; Ml et al. 1994 [16]), 

30 F5H (Bell-Lelong et al. 1 997, [3]), and peroxidase (Lagrimini et al. 1990 
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[14])), it is clear that lignin modification is possible. However, such studies 
also highlight how extremely difficult it is to achieve a change in lignin 
composition and how it is even more difficult to achieve a change that has 
commercial relevance. In the CAD antisense work, Halpin et al. [12] 
5 reported increased lignin extractability in only 2% of the transformed lines 
tested. In other words, 98% had no change despite morphological changes 
such as the appearance of red xylem. 

Therefore, these disclosures do not specifically relate to techniques involving 
10 genetic engineering to create a lignin which is unique to the plants of 
interest, i.e. gymnosperms. Firstly, all the published work on the genetic 
engineering of plants for altered lignin has been done in angiosperms and 
was done to manipulate an existing endogenous enzyme and biochemical 
pathway. Even with this, the results were variable, and changing lignin 
]5 parameters to a level such that they had commercial advantages was 

difficult. Secondly, the only example of lignin modification in gymnosperms 
where a gene for a specific enzyme in the phenylpropanoid pathway was 
down-regulated occurred in a naturally occurring mutant which had virtually 
no CAD activity (for a review see Whetton et al. 1998 [17]). In this case, 
20 genetic engineering was not used and the regulation was again dependent 
on natural mutation which altered the expression of an endogenous gene. 

International patent application PCT/US96/20094, published on July 3, 1997 
as WO 97/23599, in the name of Clint Chappie as inventor, and assigned 
25 jointly to E.I. Du Pont De Nemours and Company, and Purdue Research 
Foundation, discloses the nucleotide sequence of a gene encoding an F5H 
enzyme, the transformation of the genome of plants with the gene, and the 
resulting modification of lignin composition of the plants. The present 
application builds on this Chappie application and goes beyond, to describe 
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the use of this gene, either alone or in conjunction with other genes, to 
introduce a lignin biosynthetic pathway into gymnosperms. 

DISCLOSURE OF THE INVENTION 

5 

An object of the invention is to modify gymnosperms by genetic engineering 
so that modified gymnosperm plants produce lignin of a type that differs from 
the lignin of wild-type plants of the same species and that is more easily 
accommodated in commercial utilization of such plants. 

10 

Another object of the invention is to modify the lignin precursors in 
gymnosperms to provide modified monolignol residues, and preferably, a 
greater content of syringyl residues, or other residues with a side group at 
the C-5 position of the monolignol ring. 

15 

According to one aspect of the present invention, there is provided a process 
of producing a transformed gymnosperm plant or plant precursor having a 
genome containing at least one expressible transgene that results in 
modification of lignin composition in the gymnosperm plant compared to an 

20 average lignin composition of untransformed wild-type plants of the same 
gymnosperm species, which process comprises: providing a vector 
containing at least one expressible transgene that results in modification of 
the lignin composition in the gymnosperm plant; introducing the vector into 
cells of a gymnosperm plant to produce transformed cells; regenerating 

25 transformed gymnosperm callus or shoots from the transformed cells; 
maturing embryos or plants from the transformed callus or shoots; and 
generating transformed plant embryos, seeds, seedlings or plants from the 
matured embryos. 
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Without wishing to limit the generality of meaning of the term "transgene", we 
should point out that the term is intended to include foreign DNA (transgenic 
or introduced genes) that is introduced into a genome of a gymnosperm 
plant. 

5 

According to another aspect of the invention, there is provided a transformed 
gymnosperm plant or plant precursor having a genome containing at least 
one expressible transgene that results in modification of lignin composition in 
the gymnosperm plant compared to an average lignin composition of 
io untransformed wild-type plants of the same gymnosperm species. 

Most preferably, the lignin of the transformed gymnosperm plant contains 
detectable syringyl residues, or other residues with a side group at the C-5 
position of the monolignol ring, whereas the lignin of the wild-type plants 
is contains no detectable syringyl residues or other residues with a side group 
at the C-5 position of the monolignol ring. 

Preferably, the expressible transgenes are genes that code for enzymes 
required for the lignin biosynthetic pathway, and more preferably the third 

20 branch of the pathway by which branch 2 intermediates are converted to 
sinapyl alcohol. It is therefore to be noted that, in the present invention, at 
least in its preferred forms, gymnosperm plants are being genetically 
engineered with genes which encode at least one enzyme that is not 
normally present in these plants, thereby creating a branch to an existing 

25 pathway in gymnosperm plants. The invention therefore differs considerably 
from prior art procedures that have merely involved the modification of 
existing pathways in angiosperm plants utilizing enzymes already present in 
the wild-type plants. 
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Most preferably, the transgene(s) introduced into the gymnosperm plants 
includes a ferulate 5-hydroxylase gene, or a transgene that is substantially 
homologous to said ferulate 5-hydroxylase gene, or a transgene that has an 
equivalent function, either alone or in conjunction with other genes needed 
5 for the biosynthesis of a lignin, i.e. that results in a lignin composition 

containing syringyl residues. By a "gene that is substantially homologous to 
said ferulate 5-hydroxylase gene", we mean a gene which can be shown to 
have ferulate 5-hydroxylase activity in yeast or having at least 50% 
homology, and more preferably at least 75% homology, to the F5H gene 
10 while exhibiting an ability to modify the lignin content of the gymnosperm 
plant in vivo. 

The ferulate 5-hydroxylase gene (or equivalent gene) either alone or in 
conjunction with other genes, are normally operably linked with at least one 
is regulatory sequence, e.g. cauliflower mosaic virus 35S promoter, a promoter 
for a phenylalanine ammonia lyase gene, a promoter for a p-coumaryl CoA 
ligase gene, a promoter for cinnamate 4-hydroxylase or other plant 
promoters capable of controlling expression of plant genes. 

20 The gymnosperm plants produced by the present invention are preferably 
from the order coniferaies. Thus, they may be from the Picea species (e.g. 
Picea glauca, Picea sitchesis, or Picea engeimanii), or from the Pinus 
species (e.g. Pinus taeda or Pinus radiata). 

25 According to another aspect of the invention, there is provided a biomass 
derived from a genetically transformed gymnosperm plant, said biomass 
containing lignin having syringyl residues, or other residues with a side 
group at the C-5 position of the monolignol ring, and said transformed plant 
having an untransformed (wild-type) natural plant whose lignin contains no 

30 syringyl residues. 
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A still further aspect of the invention relates to a method of producing 
cellulose-containing pulp useful for papermaking and the like, which 
comprises a lignin-containing biomass derived from a gymnosperm plant to 
5 produce pulped mass containing lignin, and removing most of said lignin 
from said pulped mass, characterized in that said gymnosperm plant is a 
genetically transformed plant producing lignin containing syringyl residues or 
other residures with a side group at the C-5 position of the monolignol ring. 

10 As will be appreciated from the above, the present invention is capable of 
producing transformed gymnosperm plants having a modified lignin content 
that makes gymnosperm plants more attractive on a commercial and 
industrial scale. 

15 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a diagram showing the basic lignin biosynthetic pathway, the 
enzyme abbreviations being as described in this application, and the 
suggested induced pathway(s) being highlighted (the inserted box indicates 

20 the numbering in the phenyl ring). Note the three branches of the 
phenylpropanoid pathway (labeled 1), 2) and 3)): branch 1 yields the 
monolignol p-coumaryl alcohol, present in some angiosperms and 
gymnosperms; branch 2 yields the monolignol coniferyl alcohol, which is 
present in both angiosperms and gymnosperms yet is the predominant 

25 monolignol in gymnosperms; and branch 3 yields sinapyl alcohol 

predominant and present only in angiosperms, with very few exceptions. 

Figure 2 is a graph showing the mean height growth in 1997 from three 
different transformed lines derived from two different parental genotypes of 
so F5H-transformed and control (non-transformed) interior spruce somatic 
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seedlings (note F5H 2d-1 and 2d-2 are two replicate sets of somatic 
seedlings planted 2 weeks apart); 

Figure 3 shows the result of a PCR amplification of a 750 bp arabidopsis 
5 F5H fragment using the primer pair cc8/cs278 from 14 Putative Transformed 
Lines (lanes 1-14); a Non-Transformed 11026 Negative Control (lane 15); 
a Plasmid only p482-F5H = pCC87Positive Control (lane 16); a Plasmid only 
pBIC-F5H = pBIC20F5H, Control (lane 17); a Blank, no DNA Negative 
Control (lane 18); and Molecular Weight (MW) Markers (lane 19); and 

10 

Figure 4 shows the genomic nucleotide and amino acid sequences of a 
known Arabidopsis F5H gene and the F5H enzyme that it encodes (as 
disclosed in Chappie, WO 97/23599). 

15 BEST MODES FOR CARRYING OUT THE INVENTION 

As previously noted, the difficulty in lignin removal in all plants is due to the 
variety of linkages formed between monolignol precursors during lignin 
polymerization, which linkages account for lignin polymers being highly 

20 heterogeneous. This heterogeneity in lignin and the linkages formed during 
polymerization have a large influence on the pulping characteristics of wood. 
For example, the presence of the C-5 methoxylated syringyl residues make 
hardwood lignin easier to hydrolyze during pulping, while a higher proportion 
of condensed p-hydroxy phenyl residues makes softwood hydrolysis more 

25 difficult (Campbell and Sederoff 1996 [6]). This is due in part to the 
unprotected C-5 group characteristic of p-hydroxy phenyl and guaiacyl 
residues typical of softwoods, accounting for the relatively slower 
delignification rate of the softwoods (Chiang and Funaoka 1990 [8]). 
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Any modification of this complex polymer requires an understanding of the 
metabolic pathway. Fortunately, many steps in the lignin biosynthetic 
pathway are well understood (Davin et al. 1992[9]). The basic 
phenylpropanoid pathway is shown in accompanying Figure 1 . The pathway 
5 begins with the conversion of L-phenylalanine to cinnamic acid, by 

phenylalanine ammonia-lyase (PAL) followed by the conversion of cinnamate 
to 4-coumarate by cinnamate 4-hydroxylase (C4H). 4-Coumarate has 
several potential metabolic fates and these account for pathways to the three 
monolignol precursors. Thus, 4-coumarate can enter into one of the three 
10 monolignol branches of the phenylpropanoid pathway shown in Figure 1 as 
follows: 

Branch 1) Present in both angiosperms and gymnosperms where 
4-coumarate is activated to 4-coumaryl-CoA in a reaction 
15 catalyzed by 4-coumaryl-CoA ligase (4CL), and reduced by 

hydroxycinnamyl-CoA reductase (CCR) and cinnamyl alcohol: 
NAD oxidoreductase (CAD) to 4-hydroxycinnamyl alcohol (p 
-coumaryl alcohol), the first of the three monomeric lignin 
precursors; 

20 

Branch 2) Present in both angiosperms and gymnosperms where 

4-coumerate is either: A) 3-hydroxylated and 3-O-methylated to 
form ferulic acid, followed by activation by 4CL, and reduced by 
CCR and CAD to yield 3-methoxy-4-hydroxycinnamyl alcohol 
25 (coniferyl alcohol), the major lignin precursor in conifers; or B) 

activated to 4-coumarly-CoA which is subsequently 3- 
hydroxylated and 3-O-methylated to form feruloyl-CoA, which is 
then reduced by both CCR and CAD to yield 3-methoxy-4- 
hydroxycinnamyl alcohol; and 



30 
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Branch 3) Present only in angiosperms where 4-coumarate is modified as 
in branch 2); however, either: A) feruiic acid undergoes a ring- 
hydroxyiation by ferulate 5-hydroxylase (F5H) and 
O-methylation by an O-methyltransferase (OMT) to generate 
5 sinapic acid, which is reduced to yield sinapyl alcohol, the 

source of the syringyl residues typical in angiosperms; or B) 
proceeds through branch 2 to coniferaldehyde and then to 5- 
hydroxyconiferaldehyde to sinapaldehyde to sinapyl alcohol; or 
C) proceeds through branch 2 to coniferaldehyde and then to 
10 5-hydroxyconiferyl alcohol to sinapyl alcohol. 

The present invention involves the genetic engineering of gymnosperms to 
introduce one or more functional genes encoding one or more enzymes that 
results in a modification of the lignin composition of a gymnosperm plant that 

15 makes the gymnosperm plant or plant products more commercially desirable. 
The modification of gymnosperms with genes for any of the enzymes 
capable of affecting the phenylpropanoid pathway is within the scope of the 
invention, provided such genes modify the lignin composition of gymnosperm 
plants to make the plants more commercially desirable. Preferably, the 

20 transgene creates a Branch 3 metabolic pathway, or other residues with a 
side group at the C-5 position of the monolignol ring, and most preferably 
one of the genes encodes ferulate 5-hydroxylase (F5H). As noted above, 
this enzyme is thought to be absent in most gymnosperms (with few 
exceptions) and is one of the key enzymes missing in conifers which 

25 accounts for the difference between angiosperm and gymnosperm iignin 
(Campbell and Sederoff 1996 [6]). The exceptions are very few as 
previously noted and include reports of syringyl lignin in the non-coniferafes 
gymnosperm species Podocarpus and in some species of the Gnetales. 
These exceptions do not, however, detract from the invention as the vast 
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number of gymnosperms do not produce syringyl lignin and these exceptions 
are mentioned in a very minor way, if at all in the literature on the subject. 

The inventors have been successful in expressing the F5H gene in spruce (a 
5 gymnosperm) and have transformed lines containing this transgene in 
conjunction with other transgenes in the lignin biosynthetic pathway. Since 
the inventors have demonstrated expression of the F5H gene in spruce, they 
believe that its expression in other gymnosperm species is predictable since 
this clearly shows that not only the F5H gene can be expressed in 
10 gymnosperms, but also that its expression can modify lignin in plants which 
do not contain a pathway for syringyl lignin. 

Although not conclusive, support that this single enzyme (F5H), either alone 
or in conjunction with other enzymes, will alter gymnosperm lignin comes 

is from the fact that mutants of the angiosperm Arabidopsis which lack this 
enzyme produce lignin similar in composition to gymnosperms (Chappie et 
al. 1992), suggesting the lack of this one enzyme, alone or in conjunction 
with other enzymes, can account for the difference in lignin composition in an 
angiosperm where a branch of the phenylpropanoid pathway to guaiacyl- 

20 containing lignin already exists. 

As noted above, the F5H gene is known and described, e.g. in PCT 
publication WO 97/23599 on July 3, 1997. The disclosure of this publication 
is specifically incorporated herein by reference. For convenience, the 
25 nucleotide sequence of the F5H gene from Arabidopsis and the amino acid 
sequence of the F5H enzyme is shown in Figure 4 of the accompanying 
drawings. 

The F5H gene can be obtained from an angiosperm species, e.g. 
30 Arabidopsis thaliana, DNA either by polymerase chain reaction (PCR) using 
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primers designed to the 5' and 3 1 ends of the published F5H sequence in 
Figure 4, or by plasmid rescue of the fa/?1 mutant and complementation as 
was done by Meyer et al. (1996[15j). The PCR amplified product can then 
be used to identify the native gene from either a genomic or cDNA library 
5 and the gene can be subsequently cloned by standard gene cloning 

techniques. The isolation of the gene by PCR or from the faM mutant is 
believed to be within the competence of any person skilled in the art, so that 
further explanation is unnecessary. Similar techniques can and have been 
used to isolate other genes in the lignin biosynthetic pathway which can be 
10 used in conjunction with an F5H gene to modify lignin in gymnosperms. 

Several constructs of the F5H gene were obtained as explained in the PCT 
publication mentioned above. These constructs include either genomic and 
cDNA F5H genes controlled by a cauliflower mosaic virus (CaMV) 35S or 
15 cinnimate 4-hydroxylase (C4H) promoter, as well as a C4H-GUS construct to 
test the expression pattern of the C4H promoter, as well as an OMT 
construct used in conjunction with F5H and a construct containing an F5H- 
OMT translational fusion. These and other constructs used in this invention 
are listed below: 



20 



pGA482-F5H = pCC87 



a pGA482-based vector containing 
a CaMV 35S-genomic F5H 



construct; 



pBIC-F5H = pBIC20-F5H 



a pBIC20-based vector containing 
an 18kb genomic fragment 
containing both the F5H promoter 
and coding region; 



25 



pCC98 



a pBI121-based vector containing a 
CaMV 35S-cDNA F5H construct; 
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pCC223 a pBI1 01 -based vector containing a 

C4H-GUS construct for xylem 
directed expression of GUS; 
PC4H-F5H = pCC1 53 a pGA482-based vector containing 

5 a C4H-genomic F5H construct; and 

PCC99 a pGA482-based vector containing 

a double-CaMV 35S-genomic F5H 
construct. 

parabOMT a pUC based vector containing a 

10 CaMV 35S-OMT construct used for 

co-blasting with pCC99. 
PF5H-OMT a pBINPLUS derived vector 

containing a double-CaMV 35S- 
F5H-OMT translational fusion. 
15 Starting materials for such vectors are in common use for the construction of 
plant transformation vectors and are generally available around the world 
from various labs. The pBI- series is commercially available from Clontech. 
The pGA482 vector is described in 1987 Methods Enzymol 153:292-305 and 
is widely used for plant transformation. The pBIC20is a binary cosmid vector 
20 described by Meyer et a!. 1996, in Genome Mapping in Plants, ed. Paterson, 
A.H. (Landis Biochemical Press, Austin, TX). Construction of the pGA482- 
F5H and pBIC20-F5H plasmids are detailed in Meyer et al., 1996, PNAS, 
93.6869-6874 and both are available from that source (Chappie). The other 
F5H and OMT constructs were made using similar techniques. 

25 

As seen in the PCT publication mentioned above, the CaMV 35S constructs 
have been used successfully to modify lignin content in both Arabidopsis and 
tobacco and were included in the present invention to give ectopic 
expression of the F5H gene in spruce. The C4H promoter constructs should 
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direct expression to the xytem, the principal target tissue for lignin 
modification. Because the C4H promoter was isolated from an Arabidopsis 
C4H gene, its expression - as well as the expression of the native C4H gene 
- in gymnosperms was previously unknown. The OMT constructs were 
5 included to ensure, if needed, the O-methylation of the 5-hydroxylated 
branch 2 intermediates. 

To initiate transformation experiments, the plasmids were transformed into 
E. coli and were subsequently purified by CsCI gradient centrifugation. Each 
10 plasmid was checked by restriction digest to confirm its identity. Standard 
procedures were used for coating gold particles with the plasmids and for 
microprojectile bombardment of spruce somatic embryos. Regeneration of 
transformed spruce callus was done on a very low level of kanamycin (2- 
5)j.g/ml) and embryo maturation was done using routine protocols for spruce. 

15 

Over 10,000 spruce embryos were blasted with the various constructs and 
over 500 transformed seedlings from over 50 transformed lines have been 
planted in the greenhouse. No abnormal phenotypes or altered growth 
patterns have been detected in any of the transformed embryogenic callus 
20 fines or seedlings. The results of these experiments are summarized in 
Table 1 below. 



25 
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CONSTRUCT 



pCC87 
35S-gF5H 



pBIC20-F5H 
F5H-gF5H 



pCC98 
35S-CF5H 



pCC223 
C4H-GUS 



pCC153 
C4H-gF5H 



pCC99 

2X 35S-gF5H 



pCC99+ 
parabOMT 



Table 1 

Information regarding tested constructs 



# KANAMYCIN # CONFIRMED # CONFIRMED # CONFIRMED # OF LINES 
RESISTANT PCR POSITIVE SOUTHERN NORTHERN WITH 
LINES LINES POSITIVE LINES POSITIVE LINES SEEDLINGS 
REGENERATED 



16 



16 8 5 

inconclusive Inconclusive inconclusive 

2- 2 nd 

Inconclusive 

5 3 ND 



19 
Nd 



Figure 2 of the accompanying drawings shows the mean height growth of 
three different F5H transformed lines from two different parental 
embryogenic genotypes compared with non-transformed somatic seedlings 
over the growing season. Transformed line 11026 2d is represented by two 
lots of somatic seedlings planted two weeks apart. 



A set of six nested primers for the F5H gene were obtained and tested for 
10 amplification of the F5H gene from pCC87.The primer pair consisting of cc8 
and cs278 were used to amplify a band of approximately 750bp from the 
genomic F5H gene. 
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Figure 3 confirms integration by PCR of the Arabidopsis F5H in 14 different 
putative F5H transformed embryogenic callus lines (lanes 1-14). A band of 
approximately 300bp in all lanes including the blank (lane 18) and the non- 

5 transformed 11026 control (lane 15), suggests that this fragment is a non- 
specific amplification product. The band of interest, a 750bp amplification 
product, is very prominent as expected in the pCC87 plasmid only lane 
(lane 16) and is absent from both the blank and the non-transformed control. 
Note the presence of a 750bp band in DNA samples from 1 0 transformed 

10 lines (lanes 1,3,6,7,8,9,11,12,13,14) including the three transformed lines 
which have somatic seedlings in the greenhouse. The absence of the 750bp 
band in the remaining putative transformed lines could indicate that these 
lines are non-transformed escapes, or that the DNA preparation from these 
lines was poor. This later suggestion is supported by the lack of other 

15 background bands in these lanes (lanes 2,4,5,10). 

Northern blot analyses have confirmed strong expression of the F5H gene in 
spruce and Southern blot analysis of transformed lines have conclusively 
confirmed the PCR results for integration of the inserted DNA into the spruce 
20 genome (Table 1). 
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CLAIMS: 

1 . A process of producing a transformed gymnosperm plant or plant 
precursor having a genome containing at least one expressible 
transgene that results in modification of lignin composition in the 
gymnosperm plant compared to an average lignin composition of 
untransformed wild type plants of the same gymnosperm species, 
which process comprises: 

providing a vector containing at least one expressible transgene 
that results in modification of the lignin composition in the gymnosperm 
plant; 

introducing said vector into cells of a gymnosperm to produce 
transformed cells; 

regenerating transformed gymnosperm callus or shoots from the 
transformed cells; 

maturing embryos from the transformed callus or shoots; and 

generating transformed plant embryos, seeds, seedlings or 
plants from the matured embryos or shoots, 

2. A process according to claim 1, characterized in that said vector is 
provided with said at least one expressible transgene that encodes at 
least one enzyme affecting the phenylpropanoid pathway leading to 
the synthesis of lignin. 

3. A process according to claim 1 , characterized in that said vector is 
provided with sard expressible transgene that encodes at least one 
enzyme enabling the production of sinapyl alcohol or other residues 
with a side group at the C-5 position of a monoligno) ring during the 
biosynthesis of lignin. 
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4. A process according to claim 1 , characterized in that said vector is 
provided with said at least one expressible transgene that encodes at 
least one enzyme enabling the production of lignin containing syringyl 
residues or other residues with a side group at the C-5 position of a 
monolignol ring. 

5. A process according to claim 4, characterized in that said vector Is 
provided with an expressible transgene encoding ferulate 5- 
hydroxylase, or a transgene that has substantially equivalent function 
to said ferulate 5-hydroxylaee gene, either alone or in conjunction with 
other genes involved in fignin biosynthesis. 

6. A process according to claim 5, characterized in that one of the said 
substantially homologous gene has at least 50% homology with said 
ferulate 5-hydroxylase gene. 

7. A process according to claim .5, characterized in that said substantially 
homologous gene has at least 75% homology with said ferulate 5- 
hydroxyfase gene. 

8. A process according to any preceding claim, characterized in that said 
gymnosperm plant is from the order coniferales. 

9. A process according to any preceding claim, characterized in that said - 
gymnosperm plant is from the species Picea. 



10. A process according to claim 9, characterized in that said plant is 
Picea gleuca, Picea sitchesis, or Picea engelmanii. 
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11. A process according to any one of claims 1 to 8, characterized In that 
said gymnosperm plant Is from the species Pinus. 

12. A process according to claim 1 1 , characterized in that said 
gymnosperm plant is Pinus taeda or Pinus radiata. 

13. A process according to claim 5, characterized in that said ferulate 5- 
hydroxylase gene is operably finked with at least one regulatory 
sequence. 



14. A process according to claim 13, characterized in that said regulatory 
sequence is cauliflower mosaic virus 35S promoter, a promoter for a 
phenylalanine ammonia lyase gene, a promoter for a p-coumaryl CoA 
iigase gene, a promoter for cinnamate 4-hydroxylase, or another plant 
promoter capable of controlling expression of plant genes. 

1 5. A transformed gymnosperm plant or plant precursor having a genome 
containing at least one expressible transgene that results in 
modification of lignin composition in the gymnosperm plant compared 
to an average lignin composition of untransformed wild type plants of 
the same gymnosperm species. 



16. A gymnosperm plant or plant precursor according to claim 15, 

characterized in that said plant has a genome containing at least one 
expressible transgene that encodes at least one enzyme enabling the 
production of sinapyl alcohol, or other residue with a side group at the 
C-5 position of a monolignof ring, during the biosynthesis of lignin. 



17. A gymnosperm plant or precursor according to claim 1 5, characterized 
in that the plant or plant precursor has a genome containing an 
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expressible transgene that results in a lignln composition containing 
syringyl residues, or other residue with a side group at the C-5 position 
of a monolignol ring. 

1 8. A gymnosperm plant or precursor according to claim 15, characterized 
in that said at least one express/bie transgene is a gene encoding 
ferulate 5-hydroxylase, or a transgene that has substantially equivalent 
function to said ferulate 5-hydroxylase gene, either alone or in 
conjunction with other genes involved in lignin biosynthesis. 

1 9. A gymnosperm plant or precursor according to claim 1 8, characterized 
in that said substantially homologous gene has at least 50% homology 
with said ferulate 5-hydroxylase gene. 

20. A gymnosperm plant or precursor according to claim 16, characterized 
in that said substantially homologous gene has at least 75% homology 
with said ferulate 5-hydroxylase gene. 

21 . A gymnosperm plant or precursor according to claim 1 5, characterized 
in that said gymnosperm plant is from the order coniferaies. 

22. A gymnosperm plant or precursor according to claim 15, characterized 
in that said gymnosperm plant is from the species Picea. 

23. A gymnosperm plant or precursor according to claim 22, characterized 
in that said plant is Picea gfauca, Picea sitchasis, or Picea engetmanii. 

24. A gymnosperm plant or precursor according to claim 15, characterized 
in that said gymnosperm plant is from the species Pinus. 
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25. A gymnosperm plant or precursor according to claim 24, characterized 
in that said gymnosperm plant is Pinus taeda or Pinus radiata. 

26. A gymnosperm plant or precursor accord ing to claim 1 8, characterized 
in that said ferulate 5-hydroxyiase gene is operably linked with at least 
one regulatory sequence. 

27. A gymnosperm plant or precursor according to claim 26, characterized 
in that said regulatory sequence is a cauliflower mosaic virus 35S 
promoter, a promoter for a phenylalanine ammonia lyase gene, a 
promoter for a p-coumaryl CoA ligase gene, a promoter for cinnamate 
4-hydroxyiase, or any other plant promoter capable of controlling 
expression of plant genes. 

28. A biomass derived from a genetically transformed gymnosperm plant, 
said biomass containing lignin having syringyf residues, or other 
residue with a side group at the C-5 position of a monolignol ring, and 
said transformed plant having an untransformed natural wild-type plant 
whose lignin contains no syringyi residues, or corresponding other 
residues with a side group at the C-5 position of a monolignol ring. 

29. A biomass according to claim 28, resulting from growing and 
harvesting a genetically transformed plant or plant precursor as 
defined in any one of claims i 5 to 27. 

30. A method of producing a cellulose-containing pulp useful for 
papermaking and the like, which comprises finely dividing a lignin- 
containing biomass derived from a gymnosperm plant to produce 
pulped mass containing lignin, and removing at least some of said 
lignin from said pulped mass, characterized in that said gymnosperm 
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plant is a genetically transformed plant producing lignin containing 
syringyl residues or other residues with a side group at the C-5 position 
of a monoJignol ring. 

31 . A process of producing a transformed gymnosperm plant or plant 
precursor having a genome containing at least one expressible 
transgene that results in production of at least one residue of a lignin 
biosynthetic pathway having a hydroxy group at the C-5 position of a 
monolignol ring, which process comprises: 

providing a vector containing at least one expressible transgene 
that results in production of at feast one residue having a hydroxy 
group at the C-5 position of a monolignol ring; 

introducing said vector into cells of a gymnosperm to produce 
transformed cells; 

regenerating transformed gymnosperm callus or shoots from the 
transformed cells; 

maturing embryos from the transformed callus or shoots; and 

generating transformed p\ant embryos, seeds, seedlings or 
plants from the matured embryos or shoots. 

32. A process of producing a transformed gymnosperm plant or plant 
precursor having a genome containing at least one expressible 
transgene encoding an enzyme enabling hydroxylation at the C-5 
position of a monolignol ring of at least one residue in a lignin 
biosynthetic pathway, which process comprises: 

providing a vector containing at least one expressible transgene 
encoding an enzyme enabling hydroxylation at the C-5 position of a 
monolignol ring of at least one residue; 

introducing said vector into cells of a gymnosperm to produce 
transformed cells; 

regenerating transformed gymnosperm callus or shoots from the 
transformed cells; 
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maturing embryos from the transformed callus or shoots; and 
generating transformed plant embryos, seeds, seedlings or 
plants from the matured embryos or shoots. 



33. The process of claim 32. wherein the enzyme is selected from the 
group comprising ferulate-5-hydroxylase and homologs thereof. 

34. A process of producing a transformed gymnosperm plant or plant 
precursor having a genome containing at least one expressible 
transgene that results in hydroxylation of at least one residue of a 
lignin biosynthetic pathway at the C-5 position of a monolignol ring, 
which process comprises: 

providing a vector containing at least one expressible 
transgene that results in hydroxylation of at least one residue at the 
C-5 position of a monolignol ring; 

introducing said vector into cells of a gymnosperm to 
produce transformed cells; 

regenerating transformed gymnosperm callus or shoots from 
the transformed cells; 

maturing embryos from the transformed callus or shoots; and 

generating transformed plant embryos, seeds, seedlings or 
plants from the matured embryos or shoots. 



35. A transformed gymnosperm plant or plant precursor having a 
genome containing at least one expressible transgene that results 
in hydroxylation of at least one residue of a lignin biosynthetic 
pathway at the C-5 position of a monolignol ring. 

36. A transformed gymnosperm plant or plant precursor having a 
genome containing at least one expressible transgene that encodes 
at least one enzyme enabling the production of a residue of a lignin 
biosynthetic pathway with a side group at the C-5 position of a 
monolignol ring, during the biosynthesis of lignin. 
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aagcttatgtatttocttataaocattttattotgtatatagggggaoagaaacataataagtaacaaatagtggttttatttttttaaa 90 
tatacaaaaactgtttaaccattttatttottggttagoaaaattttgatatattcttaagaaactaatattttaggttgatatattgoa 180 
gteactaaatagttttaaaagacacgaagttggtaagaacaggcatatattattcgatttaattaggaatgcttatgttaatctgattcg 270 
actaattagaaacgacgatactatgagctcatagatggtcccacgacccacfcctcccatttgatcaatattcaactgagcaatgaaacta 360 
attaaaaacgtggttagattaaaaaaataaattgtgcaggtagcggatatataatactagtaggggttaaaaataaaataaaacaccaca 450 
gtattaaatttttgtttcaaaagtattatcaatagtttttttgcttcaaaaatatcacaaatttttgtatgaaatatttctttaaogaaa 540 
ataaattaaataaaatttaaaatttatatttggagttotatttttaatttagagtttttattgttaocacattttttgaattattctaat 630 
attaatttgtgatattattacaaaaagtaaaaatatgatattttagaatactattatcgatatttgatattattgaccttagctttgttt 720 
gggtggagacatgtgattatcttattacctttttattccatgaaaetacagagttcgccaggtaccatacatgcacacaccctcgtgaag BIO 
ccgtgacttaatatgatctagaacttaaatagtactaotaafctgtgtcatttgaactttcteotatgtcggtttcacttcatgtatcgca 900 
gaacaggtggaatacagtgtcottgagtttoaccoaaatcggtccaattttgtgatatatattgcgatacagaeataoagcctacagagt 990 
tttgtcttagoccactggttggcaaacgaaattgtctttatttttttatgttttgttgtcaatgtgtctttgtttttaaetagattgagg 1080 
tttaattttaatacatttgttagtttaoagattatgcagtgtaatctgataatgtaagttgaaotgcgttggtoaaagtottgtgtaacg 1170 
cactgtatctaaattgtgagtaacgacaaaataattaaaattaaaggaccttcaagtattattagtatctctgtctaagatgcacaggta 1260 
ttcagtaatagtaataaataattacttgtataattaatatctaattagtaaaccttgtgtctaaacotaaatgagcataaatccaaaagc 1350 
aaaaatctaaacctaactgaaaaagtcattacgaaaaaaagaaaaaaaaaagagaaaaaactacctgaaaagtcatgcacaacgttcatc 1440 
ttggctaaatttatttagtttattaaatacaaaaatggcgagtttctggagtttgttgaaaatatatttgtttagccactttagaattto 1530 
ttgttttaatttgetattaagatatatcgagataatgcgtttatatcaccaatatttttgccaaactagtcctatacagtcatttttcaa 1620 
oagctatgttcaotaatttaaaacccaotgaaagtcaatcatgattcgtcatatttatatgctcgaattcagtaaaatccgtttggtata 1710 
ctatttatttcgtataagtatgtaattocaotagatttccttaaaotaaattatatatttacataattgttttctttaaaagtotacaac 1800 
agttattaagttataggaaattatttcttttatttttttttttttttaggaaattatttcttttgcaaeacatttgtcgtttgcaaactt 1890 
ttaaaagaaaataaatgattgttataattgattacatttcagtttatgacagattttetttatotaacctttaatgtttgtttccotgtt 1980 
tttaggaaaatcataecaaaatatatttgtgatcacagtaaateacggaatagttatgaccaagattttcaaagtaatacttagaatcot 2 070 
attaaataaacgaaattttaggaagaaataatoaagattttaggaaacgatttgagcaaggatttagaagatttgaatctttaattaaat 2160 
attttcattcctaaataattaatgctagtggcataatattgtaaataagttcaagtacatgattaatttgttaaaatggttgaaaaatat 2250 
atatatgtagattttttcaaaaggtatactaattattttoatattttcaagaaaatataagaaatggtgtgtacatatatggatgaagaa 2340 
atttaagtagataatacaaaaatgtcaaaaaaagggaccacacaatttgattataaaacctaccCctctaaecacatcccaaaaTGGAGA 2430 
ACTTTCCCTCCTGACAACATTTGAGAAAATAATCGAATCCAA^ 2520 

MESSISQTLSK 
TATCAGATCCCACGACGTCTCTTGTCATCGTTGTCTCTCTTTTCATCTTCATGAGCTTCATCACACGGCGGCGAAGGCCTCCATATCCTC 2610 
LSDPTTSLVIVVSLFIFISFITRRRRPPYP 
CCGGTCCACGAGGTTGGCCCATCATAGGCAACATGTTAATGATGGACCAACTCACCCACCGTGGTTTAGCCAATTTAGCTAAAAAGTATG 2700 
PGPRGWPIIGNMLMMDOI.THRGLANI.AKXY 
GCGGATTGTGCCATCTCCGCATGGGATTCCTCCATATGTACGCTGTCTCATCACCCGAGGTGGCTCGACAAGTCCTTCAAGTCCAAGACA 2790 
GGLCHLRMGFLHMXAVSSPEVARQVLQVC D 
GCGTCTTCTCGAACCGGCCTGCAACTATAGCTATAAGCTATCTGACTTACGACCGAGCGGACATGGCTTTCGCTCACTACGGACCGTTTT 2880 
SVFSNRPATIAISYLTYDRADMAFAHYGPF 
GGAGACAGATGAGAAAAGTGTGTGTCATGiAGGTGTTTAGCCGTAAAAGAGCTGAGTCATG<MCTTCAGTTCGTGATGAAGTGGACAAAA 2970 
WRQMRXVCVMKVFSRKRAESWASVRDEVDK 
TGGTCCGGTCGGTCTCTTGTAACGTXGGTAAgctacttcacatattcaccactcttgctatatatatgtgcaattaaacaaatatgtaaa 3060 
MVRSVSCNVQK 

aagtaaaagtactoatttcttctttctttagtatgtaotttaacatttaacoaaaacaafctgtaggtaagCCTATAAACGTCGGGGAGCA 3150 

PINVGEQ 

AATTTTTGCACTGACCCGCAACATAACTTACCGGGCAGCGTTTGGGTCAGCCTGCGAGAAGGGACAAGACGAGTTCATAAGAATCTTACA 3240 
IFALTRHITYRAAFGSACEKGQDEFIRILQ 

AGAGTTCTCTAAGCTTTTTGGAGCCTTCAACGTAGCGGATTTCATACCATATTTCGG 3330 
EFSK1.FGAFNVADFIPYFGWIDPQGINKRIJ 

CGTGAAGGCCCGTAATGATCTAGACGGATTTATTGACGATATTATCGATGAACATATGAAGAAGAAGGAGAATCAAAACGCTGTGGATGA 3420 

TGGGGATGTTGTCGATACCGATATGGTTGATGATCTTCTTGCTTTTTACAGTGAAGAGGCCAAATTAGTCAGTGAGACAGCGGATCTTCA 3510 

GDVVDTDMVDDLI.AFYSEEAKI.VSETADLQ 
AAATTCCATO^CTTACCCGTGA(^TATCaAAGCAATCATCATGGtaattatatttcaaaaagoaotagtcatagtcat;gtttcttaa 3600 

tgcgttacgtaataatacttatccattgacoagttattttctcctaagtttttttgtttgaattaggaaggtaattttctattttactag 3690 
agaaageaaoagattttagcatgatctttttttaatatatatagaagcattgaatattcagatctacaataattatgaaactaatgaaga 3780 
gacaaaaaatggagagagaaaaaagaaagagtggactagtgtggatatatttaattctaatttgattttattaggacgttatatttaatt 3870 

ctaatttgatttttttatttgattttattaggACGTTATGTTTGGAGGAACGGAAACGGTAGCGTCGGCGATAGAGTGGGCCTTAACGGA 3960 

DVMFGGTETVASAIEWALTE 
GTTATTACGGAGCCCCGAGGATCTAAAACGGGTCCAACAAGAACTCGCCGAAGTCGTTGGACTTGACAGACGAGTTGAAGAATCCGAaV.T 4050 
DLKRVQQELAEVVGLD - - """"""" 

- __KCTLKETIjRHHPPXPL 
CACTAGTATCGACGGTTTCTTCATTCCCAAGAAATCTCGI'GTGATGATCAACGCGTTTGCCATAGGACGCGACCCAACCTCTTGGACTGA 4230 

TSIDGFFIPKKSRVMINAFAIGRDPTSWTD 
CCCGGACACGTTTAGACCATCGAGGTTTTTGGAACCGGGCGTACCGGATTTCAAAGGGAGCAATTTCGAGTTTATACCGTTCGGGTCGGG 4320 

PDTFRPSRFI.EPGVPDFKGSNFEFIPFGSG 
TCGTAGATCGTGCCCGGGTATGCAACTAGGGTTATACGCGCTTGACTTAGCCGTGGCTCATATATTACATTGCTTCACGTGGAAATTACC 4410 

RRSCPGMQLGLYALDLAVAHILHCFTWKLP 
TGATGGGATGAAACCAAGTGAGCTCGACATGAATGATGTGTTTGGTCTCACGGCTCCTAAAGCCACGCGGCTTTTCGCCGTGCCAACCAC 4500 

DGMXFSELDMNDVFGLTAPKATRLFAVPTT 
GCGCCTCATCTGTGCTCTTTAAGTTTATGGTTCGAGTCACGTGGCAGGGGGTTTGGTATGGTGAAAACTGAAAAGTTTGAAGTTGCCCTC 4590 

ATCGAGGA^TGTGGATGTCATATGTATGTATGTGTATACACGTGTGTTCTGATGAAAACAGATTTGGCTCTTTGTTTGCCCTTTTTTTT 4680 
Tl M l'T T CTTTAATGGGGATTTTCCTTGAATGAAATGTAACAGTAAAAATAAGA' l ' 'r'J."l"TT TCAATAAGTAATTTAGCATGTTGCaaagatcg 4770 
atcttggatgagaacttctacttaaaaaaaaaaaaaaaatttttttttagttatttcacctttttcttttgttctggttgtatggttgcc 4860 
attgtgtcaattaggggctggaagttcgctggttaaggctaaatcagagttaaagttataattttacaagcccaacaaaaggtcgcagat 4950 
taaaaccaeatgatatttataaaaaaaattctaaggtttttattagttttattttcagtttactgagtactatttacttttttatttttt 5040 
gcaaataaatgtattttatoatatttatgttttttgttataaaotooaaaeatacaggtttcattacctaaaaaaagacagagtggtttc 5130 
gttaattttgtttoattaatctcgag 5220 
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